Boosting Exploration in Actor-Critic Algorithms by Incentivizing Plausible Novel States
- Creator: Banerjee, Chayan , Chen, Zhiyong , Noman, Nasimul
- Resource Type: conference paper
- Date: 2023
Improving sample efficiency in deep reinforcement learning based control of dynamic systems
- Creator: Banerjee, Chayan
- Resource Type: thesis
- Date: 2023
NENYA: Cascade Reinforcement Learning for Cost-Aware Failure Mitigation at Microsoft 365
- Creator: Wang, Lu , Zhao, Pu , Zhang, Hongyu , Rajmohan, Saravan , Zhang, Dongmei , Du, Chao , Luo, Chuan , Su, Mengna , Yang, Fangkai , Liu, Yudong , Lin, Qingwei , Wang, Min , Dang, Yingnong
- Resource Type: conference paper
- Date: 2022
Optimal Actor-Critic Policy With Optimized Training Datasets
- Creator: Banerjee, Chayan , Chen, Zhiyong , Noman, Nasimul , Zamani, Mohsen
- Resource Type: journal article
- Date: 2022
Physics Informed Intrinsic Rewards in Reinforcement Learning
- Creator: Jiang, Jiazhou , Fu, Minyue , Chen, Zhiyong
- Resource Type: conference paper
- Date: 2022
Reinforcement learning using expectation maximization based guided policy search for stochastic dynamics
- Creator: Mallick, Prakash , Chen, Zhiyiong , Zamani, Mohsen
- Resource Type: journal article
- Date: 2022
Stochastic Optimal Control for Multivariable Dynamical Systems Using Expectation Maximization
- Creator: Mallick, Prakash , Chen, Zhiyong
- Resource Type: journal article
- Date: 2022
Reinforcement Learning Based Multi-Agent Resilient Control: From Deep Neural Networks to an Adaptive Law
- Creator: Hou, Jian , Wang, Fangyuan , Wang, Lili , Chen, Zhiyong
- Resource Type: conference paper
- Date: 2021
Modelling railway traffic management through multi-agent systems and reinforcement learning
- Creator: Bretas, A. , Mendes, A. , Chalup, S. , Jackson, M. , Clement, R. , Sanhueza, C.
- Resource Type: conference paper
- Date: 2019
Reinforcement learning for constrained energy trading games with incomplete information
- Creator: Wang, Huiwei , Huang, Tingwen , Liao, Xiaofeng , Abu-Rub, Haitham , Chen, Guo
- Resource Type: journal article
- Date: 2017
Towards neural knowledge DNA
- Creator: Zhang, Haoxi , Sanín, Cesar , Szczerbicki, Edward , Zhu, Ming
- Resource Type: journal article
- Date: 2017
A multi-agent cooperative reinforcement learning model using a hierarchy of consultants, tutors and workers
- Creator: Abed-Alguni, Bilal H. , Chalup, Stephan K. , Henskens, Frans A. , Paul, David J.
- Resource Type: journal article
- Date: 2015
Learning nursery rhymes using adaptive parameter neurodynamic programming
- Creator: Walker, Josiah , Chalup, Stephan K.
- Resource Type: conference paper
- Date: 2015
Of matchers and maximizers: how competition shapes choice under risk and uncertainty
- Creator: Schulze, Christin , van Ravenzwaaij, Don , Newell, Ben R.
- Resource Type: journal article
- Date: 2015
Cooperative reinforcement learning for independent learners
- Creator: Abed-Alguni, Bilal Hashem Kalil
- Resource Type: thesis
- Date: 2014
Robot emotions generated and modulated by visual features of the environment
- Creator: Wong, Aaron S. W. , Nicklin, Steven , Hong, Kenny , Chalup, Stephan K. , Walla, Peter
- Resource Type: conference paper
- Date: 2013
Reinforcement learning, logic and evolutionary computation: a learning classifier system approach to relational reinforcement learning
- Creator: Mellor, Drew
- Resource Type: book
- Date: 2009